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Abstract 


‘.rJJ 

"Recent  results  for  getting  diffusion  limits  of  a  sequence 
of  suitably  scaled  stochastic  processes  are  applied  to  the 
synchronization  problem  for  a  digital  phase  locked  loop  (DPLL) . 

The  discrete  time  parameter  error  processes  is  suitably  amplitude 
scaled  and  interpolated  into  a  continuous  time  parameter  process. 
For  small  filter  gains  and  symbol  intervals,  a  diffusion  process 
approximation  is  rigorously  obtained.  This  approximation  is  a 
Gauss-Markov  process  and  it  yields  approximate  error  variances, 
passage  time  distributions,  correlation  properties,  (among  other 
properties)  for  the  DPLL.  The  tracking  problem  when  the  clock 
drifts  is  also  treated.  The  technique  is  applicable  to  a  wide 
variety  of  related  problems,  to  get  continuous  time  Markov  systems 
which  are  easier  to  analyze  the  original  (continuous  or  discrete 
time)  systems  which  they  approximate. 
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1.  Introduction 


In  Chapter  9  of  [1],  Lindsey  and  Simon  develop  several 
interesting  digital  phase  locked  loops  (DPLL)  for  the  purpose  of 
symbol  synchronization.  In  theireffort  to  estimate  the  variances 
of  the  epoch  estimation  errors,  it  was  assumed  that  the  DPLL 
adjustment  rate  is  slow  and  the  errors  small.  Then  an  "equivalent" 
phase  locked  loop  (PLL)  was  found,  with  an  "equivalent"  white 
noise  input.  The  error  variance  of  a  linearized  form  of  this  PLL 
was  then  used  as  an  approximation  to  the  error  variance  of  the  DPLL. 
In  the  development  of  such  a  continuous  time  parameter  approximation 
there  are  (or  must  be)  either  implicit  or  explicit  amplitude 
scalings  of  the  signal  and  noise  and  of  the  system  gains.  By 
speaking  of  an  "equivalent  PLL",  and  using  it  to  estimate  the  error 
variances,  there  is  at  least  the  tacit  recognition  that  for  some 
suitable  amplitude  scaling  of  the  error  sequence,  there  is  a  con¬ 
tinuous  parameter  interpolation  of  the  error  sequence  which  is 
close  in  some  statistical  sense  to  the  output  of  the  "equivalent" 

PLL.  But  the  exact  sense  in  which  the  PLL  is  "equivalent"  or  close 
is  not  clear,  owing  to  the  informality  of  the  development  and  the  use 
of  a  spectral  analysis  technique  which  fixed  the  state  variable, 
and  does  not  allow  it  to  vary  naturally.  The  general  idea  is  useful, 
however,  since  owing  to  "central  limit  theorem"  like  effects,  the 
complicated  detailed  structure  of  the  DPLL  would  be  replaced  by 
a  PLL  with  a  white  noise  input,  which  is  easier  to  analyze.  When 
speaking  of  closeness  of  a  DPLL  and  a  PLL,  we  might  mean  that  if 
the  DPLL  were  parametrized  (by,  say,  the  symbol  interal  T  or  by 
a  system  gain),  then  as  the  parameter  converged  to  (say)  zero,  the 
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output  of  the  DPLL  converged  in  a  suitable  sense  to  the  output 
of  the  PLL.  Here,  a  systematic  and  rigorous  way  of  doing  this  is 
developed.  The  technique  has  wide  applicability.  The  specific 
end  results  are  of  the  same  type  as  obtained  in  [1],  except  that 
owing  to  the  "weak  convergence"  nature  of  the  approximation,  much 
information  on  the  DPLL  beyond  the  error  variances  can  be 
(approximately)  obtained  from  the  limit  process. 

Recently  a  very  useful  technique  [2]  has  been  developed  for 
getting  precise  (in  a- sense  to  be  described  below)  diffusion  limits 
of  a  sequence  of  suitably  scaled  (and  suitably  interpolated  into  a 
continuous  parameter  processes)  stochastic  difference  equations.  Here, 
these  methods  are  applied  to  the  synchronization  problem,  and  the 
correct  approximating  diffusion  is  obtained  in  a  mathematically 
rigorous  way.  The  limit  could  conceivably  be  interpreted  as  the 
output  process  of  a  particular  PLL  whose  input  noise  is  white 
Gaussian. +  But  the  important  thing  is  that  it  is  not  necessary  to 
make  ad-hoc  assumptions  in  the  development.  The  method  can  be 
used  to  handle  a  wide  variety  of  structurally  similar  problems 
in  a  systematic  way.  For  specificity,  we  treat  the  scheme  of 
Figure  9.34  of  [1]  under  the  noise  assumptions  there.  See  Figure  1 
for  the  system.  The  same  general  scheme  has  been  applied  to  other 
problems  in  [3];  namely,  to  get  diffusion  approximations  to  the 
"state"  processes  of  a  learning  automata  for  adaptive  telephone 
routing  and  an  adaptive  quantizer.  The  diffusion  approximations 
are  much  easier  to  study  than  the  original  processes.  Related 
"continuous  time"  methods  have  been  applied  in  [4]  to  several 
"continuous  time"  problems. 

+We  do  not  emphasize  this  because  the  limit  equation  is  quite 
simple-and  an  interpretation  is  not  helpful. 
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The  specific  problem,  scaling  and  interpolation  will  be 
developed  in  Section  2.  The  development  is  for  the  simple  case 
connected  with  ([1],  Figure  9.34).  Extensions  to  more  general 
noise,  intersymbol  interference  and  clock  drift  are  discussed  in 
Section  5.  As  will  be  clear,  the  technique  gives  more  information 
than  simply  an  approximation  to  the  error  variance.  In  Section  3, 
the  general  background  theorem  is  given,  together  with  some 
definitions  from  the  theory  of  weak  convergence  of  a  sequence  of 
stochastic  processes.  In  Section  4,  the  theorems  of  Section  3  are 
applied  to  the  problem  of  Section  2,  and  the  main  limit  theorem 
obtained . 
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2.  The  DPLL;  formulation,  scaling  and  interpolation 

The  circuit  is  given  in  Figure  1,  and  Figure  2  gives  the 
timing  sequences.  In  this  section  and  in  Section  4,  the  signal 
sequence  {sn>  is  a  sequence  of  independent  random  variables, 
where  sn  =  ±Aq  and  P{sn  =  Aq}  =  j  »  and  s(t)  =  input  signal  = 
sn  in  the  interval  [nT+6g,  nT+T+fig) ,  where  6^  is  the  unknown 
epoch  which  is  to  be  estimated.  Since  only  the  estimation  errors 


are  important,  with  no  loss  of  generality  we  set  =  0.  The 


input  noise  nT(t) 

given  below.  Let 

2 

oTt.  We  subscript 


is  white  Gaussian,  and  its'  power  will  be 

ft 

wT(t)  =  nT(s)ds  =  Wiener  process  with  variance 

1  Jo1 

nT(-)  and  w,j.(*)  by  T  for  reasons  to  be 


discussed  below  (2.3).  More  general  signal  and  noise  models  will 

A  f  J| 

be  discussed  in  Section  5.  Let  e  denote  the  n  estimate  of 

n 


and  set 


A  = 

n 


<VV'T  ■  VT- 


The  algorithm.  Using  the  two  parameters,  xn_i>xn>  define 
en(*,*)  (see  Figure  1)  by 

(2.1)  enC»n.1>V  *  lsn(1-4-Xn-l>T  *  W4‘Xn>T 

+  wT((n+l+A+An)T)  -  wT((n+A+An.1)T) | 

-  lVA'Xn-l)T  +  sn*l(1‘A+Xn)T 
♦  wT((n+2-A  +  An)T)  -  wTC[n+l-A+An_1)T)  |  . 

Throughout,  it  is  assumed  (as  in  [1])  that  a  <  1/4.  With 
use  of  a  general  finite  memory  linear  filter,  (£n>,  (^n),  *s 
defined  by 


s 


(2.2) 


:  ,  =  e  +  y  I  “.e  .(X  .  ,  ,X  •) 

n+1  n  1  n-i'-  n-i-1’  n-x-' 


X  .  =  X  +  ^  I  a.e  .(X  .  ,X 

n+1  n  T  .“0  l  n-xL  n-i-1*  n-i-'* 


where  Y  >  0.  The  technique  for  (2.2)  is  very  similar  to  that 
for  (2.3).  The  limits  all  have  the  form  (4.1)  and  differ  only  in 
the  power  of  the  input  noise  and  in  9  being  replaced  by  0(1  a • )  • 
We  will  work  with  (2.3)  for  simplicity,  where  g  (X,X')  =  e^(X,X?)/T. 


(2.3) 


n+1 


=  X 


+  l 


n 


T  e  (X 

T  n^  n- 


i’V  - 


x  ♦  if  g„e 


n  n- 1 


,X  ) 


n 


In  any  particular  application,  where  T  is  fixed  a-priori, 
oT  is  determined  from  the  problem  data.  But,  consider  a  sequence 

of  systems  of  the  form  (2.3),  the  sequence  being  parametrized  by 

/\ 

T  -+  0.  Assume,  for  purposes  of  this  argument,  that  e  =  0 ,  and 

fT 

that  SqT  +  J  n^(s)ds  =  yQ  is  used  to  estimate  sQ  via  a  likelihood 
ratio.  Thus,  if  yQ  >  0,  then  Sg  =  Aq  >  0  is  chosen.  Note 


Pichoosing  sq  =  AQ  >  0|  sQ  =  -Ag)  = 


ApT 

oT/r 


dN(0,l), 


where  N(0,1)  is  the  standard  normal  distribution.  Thus,  a  natural 

2  2 

parametrization  is  =  o  T  for  some  constant  o.  Note  that  the 

(noise  power  in  a  bandwidth  of  order  l/T)/signal  power) is 

constant,  under  the  above  scaling. 

As  Y  -*•  0,  the  continuous  parameter  interpolation  (interpolation 
interval  Y)  of  {X^}  converges  to  the  solution  of  an  ordinary 
differential  equation 


which  will  not  yield  the  detailed  path  information  which  is 
desired.  A  further  normalization  is  required  for  this.  It  is 
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convenient  to  write  g  (X  . ,A  )  in  the  form 

®n  n-l  n 

8n(Vl-V  -  K‘-44n-l>sn  *  (4*Xn)sn.l 

+  w (n  +  l'+A+X  )  -  w(n+A+Xn  l)| 

-  IOViK  +  a-A^n)sn+i  +  w(n+2-A  +  Xn)  -  w(n+l-A+Xn .j) 


where  w(‘)  is  a  Wiener  process  with  variance*  o  t. 

Y  Y 

Next,  define  U  =  X  //y  and  define  the  process  U  (•)  by 
n  n 

Y  Y 

U  (t)  =  Un  on  each  interval  [nY,nY+Y).  With  parameter  X,X' 
replacing  xn-i»xn>  define  g(X,X')  =  Egn(X,X’),  and  define 

*n*Xn-l,Xn^  S  gn(An-l,Xn)  *  g(Xn-l »Xn* '  We  can  now  write  the 
normalized  and  centered  iteration  as 


(2.4) 


U 


n+1 


=  U 


n 


y  (g  (X 


n-l 


,xn)//Y) 


/Y 


5Y(x  , 

nv  n-l 


X  ). 

n' 


Define  the  derivative  ^  g(X,X) |^_q  =  '0  •  It  can  be  shown  that 
9  >  0.  For  the  analysis,  it  is  convenient  to  expand  (2.4)  as 

<2-5>  ‘  <  -  1,9 UI  *  *  /rS> 

2  2 

where  vn  are  0(|Xn|  +  |Xnl|  +  |U^-U^_X | )  and  0(x)/|x|  is 
bounded. 

The  limit  theorem  of  Section  4  implies  that  (U  (•)}  converges 
in  distribution  to  a  particular  Gauss-Markov  diffusion  U(0  as  y  -*■  f 

+We  can  define  w(t)  =  Wj(tT)/T. 
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This  limit  would  be  the  output  of  the  "equivalent"  PLL,  and  only 
makes  sense  as  a  "near  equivalence"  if  Y  is  small.  In  the  cited 
section  of  [1],  it  is  also  supposed  that  the  error  estimates 
change  slowly  (small  Y)  and,  in  fact,  that  the  X^  are  "constant" 
over  a  "long  period"  of  time.  The  latter  extraneous  assumption 
is  not  needed  here. 

Properties  of  {X  }  are  obtained  from  X  =  /y  UY  ~  /y  U(nY) 
r  n  n  n 

or,  equivalently,  from  (nY  =  t) 

(2.6)  /Y  U(t)  -  X  j  -  /?  U(nT^). 

Although  the  result  does  not  depend  on  it,  Y  would  normally  depend 
on  T,  and  the  limit  results  suggest  the  appropriate  form  of  the 
dependence.  Since  we  are  concerned  with  the  behavior  of  (Xn> 
over  real  time  intervals  (n:  nT  <  t),  by  (2.6)  wt  should  have 
Y  -*■  0  as  T  ->  0.  If  Y/T  -*■  0  as  T  -*■  0 ,  then  (2.6)  implies  that 
the  system  output  becomes  (Un),  constant  on  any  finite  time 
interval  as  T  ■+  0 .  Let  nT  =  t.  If  Y/T  as  T  then 

X  [  t/T]  /  ~  U(nT*Y/T)  ~  U(°°)  (U(°°)  has  the  limit  distribution, 

as  t  ",  of  U(t)).  In  particular,  let  Y  =  cTa ,  a  <  1.  Then 
the  smaller  is  <*,  the  larger  are  the  errors.  The  best  and  most 
natural  form  in  Y  =  cT.  Then  the  change  of  X^  per  sample  is 
proportional  to  the  symbol  interval  width.  The  initial  error  XQ 
must  be  0(/y) ,  for  otherwise  the  system  (2.3)  will  not  be  able 
to  improve  the  estimate  for  small  Y . 


Via  the  method  of  the  next  section,  it  can  be  shown  that  the 
terms  in  (2.5)  contribute  nothing  to  the  limit.  For  the  sake 
of__simplicity ,  we  drop  _  them  now.  Thus,  henceforth  we  work  with 


the  partially  linearized  form 


(2.7) 


u*  =  -  yQvl  +  «„(*  i 

n+l  n  n  n  n-1  n' 
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3.  Mathematical  Background 

3a.  Remarks  on  weak  convergence  theory.  The  theory  of  weak  con¬ 
vergence  of  a  sequence  of  probability  measures  is  a  powerful  tool 
which  has  found  applications  in  many  areas  of  applied  probability 
[5],  [6],  [9].  Only  a  few  comments  will  be  made  here.  For  a 
full  treatment,  see  [10].  Let  D[0,°°)  denote  the  space  of  real¬ 
valued  functions  which  are  right  continuous  and  have  left  hand 

v 

—  limits.  The  piecewise  constant  process  U  (•)  can  be  treated 

as  an  abstract  random  variable  with  values  in  D[0,°°),  and  it  induces 
a  probability  measure  on  it,  (actually  on  the  sets  of  D[0,°°) 

defined  by  a  certain  topology,  called  the  Skorokhod  topology  -  but 
this  need  not  concern  us  here).  The  sequence  {UY(*))  is  said  to 
be  tight  if  for  each  6  >  0,  there  is  a  compact  set  £  D[0,°°) 
such  that  P{UY(-)  £  Kfi}  >1-6  for  each  6.  The  sequence 
{ UY ( • ) )  converges  weakly  to  a  process  U(-)  if  U(-)  has  paths 
in  D[0,°°)  and  induces  a  measure  P  on  it,  and  if  for  every 
bounded  and  continuous  real  valued  function  F(’)  on  D[0,°°), 

/  F(v)dP^(v)  J  F (v) dP (v)  as  y  +  0.  If  {UY(-)}  is  tight,  then 

each  subsequence  contains  a  further  subsequence  which  converges 

weakly  to  some  process  with  paths  in  D[0,°°).  In  Section  4,  it  will 

be  shown  that  for  our  problem  all  limits  are  actually  the  same  Gauss- 

Markov  process.  The  limit  will  give  us  the  desired  information  about 

y 

the  errors  and  dynamics  of  {U^}  for  small  Y.  Weak  convergence  is 
a  substantial  generalization  of  convergence  in  distribution. 

Theorem  1  below  gives  criteria  for  tightness  and  weak  convergence  to  a 


specific  limit  which  are  readily  verifiable  for  our  problem  directly 
in  terms  of  the  problem  data.  Despite  the  abstract  framework,  the 
techniques  are  readily  usable  on  problems  such  as  the  one  of  Section  2 
and  extensions,  and  the  method  of  proof  of  Theorem  2  illustrates  the 
relatively  straightforward  way  in  which  the  abstract  Theorem  1  can 
often  be  applied. 

3b.  Remarks  on  the  limit  theorem.  Let  B(.)  denote  a  standard 
Wiener  process  (covariance  t)  and  x(-)  the  solution  to  the  scalar 
stochastic  differential  equation 

(3.1)  dx  =  k(x,t)dt  +  v(x,t)dB, 


where  we  suppose  that  k(-,’)  and  v(-,*)  are  continuous  and  that 

v 

(3.1)  has  a  unique  solution  (in  the  sense  of  distributions).  Let  Rn 
denote  the  set  {^J,  j  <  n»  uj,  j  <  n),  and  let  EY  denote  the 
conditional  expectation  given  RY.  Define  the  conditional  "average 
difference"  operator  AY  by  AYf(ny)  =  [EYf(ny+y)  -  f(ny)]/y,  where 
f(-)  is  a  function  which  is  constant  on  the  intervals  [ny,ny+y)  and 
which  depends  on  at  most  RY  at  time  ny.  The  operator  A  defined  by 


(3.2) 


Af (x,t) 


k(x,t) 


\  v2(x,t) 


32f 


is  the  differential  generator  of  the  process  (3.1).  If 


AYf(UY(ny)  ,ny)  -  (A+3/3t) f (UY(ny) ,ny)  0 


(3.3) 


as  Y  -*■  0  for  a  suitably  large  class  of  functions  f,  then,  under 

y 

some  subsidiary  conditions  one  could  conclude  that  U  (•)  -*  x(*) 
weakly.  Unfortunately,  (3.3)  is  hard  to  get  and  does  not  hold  in 
our  case.  Kurtz  [7]  showed  that  if  (3.3)  holds  when  the  left  hand 

y 

f  is  "perturbed"  to  some  f  which  is  close  to  f,  then  under 
some  subsidiary  conditions  the  processes  will  converge  weakly. 

This  point  of  view  was  developed  and  simplified  in  [8],  [2].  Here, 
we  use  the  form  developed  in  [2],  which  is  the  most  convenient  for  the 
purposes  of  this  paper. 

For  purely  technical  reasons  in  the  proof  it  is  convenient  to 
bound  the  process  UY(-)  in  the  manner  given  below.  This  bounding 
is  used  only  in  the  theorem  statement  and  as  a  technical  device 
in  the  proofs.  It  does  not  affect  the  result.  If,  for  each  bound,  the 
sequence  of  bounded  processes  converges  weakly,  then 
the  original  sequence  converges  as  desired.  Let  b^f-)  denote  a 
continuous  function  which  is  zero  in  {x:  |x|  >  N  +  1},  equal  to 

unity  in  (x:  [x|  <  N),  and  is  infinitely  differentiable.  Define 

{U^’N}  by 


(3.4) 


U 


n+1  “  UnY,N  -  ^  CX-l^Vn^* 


Here  X^//y  =  defines  X^.  The  sequence  {|UY,^|}  is 

n  n  n  L  1  n  •  J 

Y  ^ 

stopped  once  it  passes  N  +  1.  Let  U1’  (•)  be  the  piecewise 

constant  process  which  equals  U^’  on  the  intervals  [ny»  ny+y) . 

N 

In  Theorem  1,  for  each  N,  A  stands  for  an  operator  of  the  form 
(3.2)  whose  coefficients  are  continuous  and  equal  those  of  the 

operator  A  in  the  set  (x:  |x|  <  N}.  The  expressions  ’  and 
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AY’N  denote  (resp.)  expectation  conditioned  on  RY’N  =  {UY,N  i  <  n 

n  j  ’  J  -  ’ 

£  T > A  (X^_  ^  jXx1)  ,  j  <  n}  and  the  "conditional  average  difference"  operator 


A  ’  f(ny)  =  [EY’Nf (ny+Y)  -  f(ny)]/Y. 

Theorem  1  is  an  adaptation  of  Theorems  2  and  3  of  [2]  to  our 
problem.  We  use  &  =  set  of  functions  of  (x,t)  with  compact 

support  and  whose  mixed  partial  derivatives  up  to  order  3  are 
cont inuous . 

3c .  The  main  background  theorem. 

Ito J  .  Assume  the  conditions  on  the  coefficients  of  AN  and 
A  given  above t  and  on  the  uniqueness  to  the  solution  of  (3.1).  For  each 

i_nt€?£er  N  and  f  ( *  »  * )  €  3,  suppose  that  there  is  a  sequence  of 
random  functions  fY,N(-)  satisfying  the  following  conditions: 

rY ,N,  , 

i  '  — — constant  on  each  [Yn,Yn+Y)  interval  and  depends  only  on 

Uj  ’  ,  j  <  n,  there .  For  each  N  and  t^  <  °°  (recall 

*1'*  "  UY-N(ny)) 


(3.5) 


(3.6) 


sup  E|fY,N(nY)|  <»,  sup  E | Ay ,Nf Y ,N (nY)  |  <  - ,  ny  <  tn 
n,Y  n , Y  '  u 

E ! fY ,N(ny)  -  f(UY’N,nY)|  -  0, 

E[AY’NfY’N(ny)  -  (|^  +  AN)f  (UY’N,nY)  |  ->0  as  y  -  0, 

and  nY  tn. 
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Then  if  {UY,N('),  Y  >  0}  is  tight  for  each  N,  and  UY,N(0) 
converges  to  some  U(0)  in  distribution  as  Y  +  0 ,  we  have 
UY(‘)  -*•  x(*)  weakly,  where  x(’)  solves  (3.1)  with  x(0)  =  U(0). 

The  sequence  (UY,N(*).  Y  >  0}  is  tight  for  each  N  if  for 
each  tQ<  «>, 

(3.7)  lim  P{  sup  |fY,N(nY)  -  f(UY’N,ny)|  >  <*)  =  0,  each  a  >  0, 

Y-*0  Yn<tQ 

lim  lim  P{  sup  |AY '^f Y ’ ^ (nY) |  >  K)  =  0. 

K>°°  Y>0  Yn<tQ 


(3.8) 
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4.  The  Limit  Theorem 

As  noted  in  Section  2,  the  magnitude  of  the  initial  errors 
must  be  commensurate  with  the  gain  Y.  Otherwise  the  system  of 
Figure  1  will  not  function  for  small  Y  and  T.  For  simplicity 
assume  that  there  is  a  random  variable  Uq  such  that  Aq//y  -*•  Uq 
as  A  ->  0.  In  the  work  connected  with  [1,  Figure  9.34],  it  was 
implicitly  assumed  that  Uq  =  stady  state  solution  to  (4.1)  below. 

Y 

'  Theorem  2.  {U  (•))  converges  weakly  to  the  solution  U(-) 
of  the  Gauss-Markov  equation  (4.1),  jis  Y  0 . 

(4.1)  dU  =  -QUdt  +  vdB ,  u(0)  =  Uq. 

In  (4.1),  B ( • )  is  a  standard  Wiener  process  and  (see  above  (2.4) 
for  the  definitions  of  gn»g)  and  v  is  given  by 

v2  =  2E[gn+1(0,0)  -  g(0,0)][gn(0,0)  -  1(0,0)] 

+  E[gn(0,0)  -  g(0,0)]2  ,  an^  n  >  1, 

2 

Note .  The  form  of  v  is  similar  to  that  obtained  formally  by  the 
method  of  [1,  p.  445-447]. 

Proof.  We  need  only  verify  the  conditions  of  Theorem  1  for 
the  process  (U^’^}  of  (3.4),  for  each  fixed  N.  The  proof  is 

y  N 

relatively  straightforward.  The  systematic  way  in  which  the  f'*  (•) 
are  constructed  is  typical  of  the  method  in  other  problems.  Henceforth, 
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the  test  function  f(#,*)  £  9)  is  fixed  (recall  the  definition  of 

Qi  given  above  Theorem  1)  and  for  notational  convenience,  we  omit 

a  y  m  Y  N 

the  superscript  N  on  everything  except  A  *  and  E^*  .  We  will 

Y  Y 

get  the  perturbed  test  function  f  (•)  in  the  form  f  (ny)  = 
f(UY,nY)  +  fY(nY)  +  f^(nY)  +  f^CnY),  where  the  f?  are  to  be 
chosen  sequentially  such  that  the  conditions  of  Theorem  1  hold. 

Start  by  applying  AY,N  to  f (•,•): 


(4.3)  Y  AY’Nf(U^,nY)  =  Yft(UY,nY)  +  o(y)  -  Yfu(UY ,nY)0UYbN(UY) 


+  /y  f,  (IT  ,nY)b..(Ur)Er  ’  5  r  (A  .  ,X  ) 
u'-  n’  J  Nv  nJ  n  nv  n-1*  n' 

*  Y  fuu(Ufn-Y) 


+  0 


In* 


The  ojn  is  a  remainder  term  in  the  truncated  Taylor  expansion  and 
satisfies 


(4.4) 


,Y  _  PY,N 
In 


<•3/2  1 1  c  Y ^ 


=  E^’  0(Yo'"(Ki;(An_1,An)|  +  1)) 


For  future  use,  note  that  (owing  to  the  properties  of  the  Wiener 
process)  for  each  t^  <  <*>,  and  N  >  0, 

lim  sup  I oY  /y |  ■  0  w.p. 1 . , 

Y^O  nY< tQ 

lim  sup  El oT  /Y |  =  0 . 

Y-*-  0  nY<  t  o  in 


(4.5) 
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All  Oj.n  introduced  below  also  satisfy  (4.5). 

Only  the  first  and  third  terms  on  the  right  of  (4.3)  can  be 

part  of  an  operator  such  as  (3/9t  +  AN)  .  The  4th  and  5th  terms  of 

Y  Y 

(4.3)  depend  on  the  noise  as  weH  as  on  (Un,nV)  and 

Y 

need  to  be  "averaged  out".  The  perturbation  f^  is  chosen  to 
"average  out"  the  f  term.  Define+ 

V  ^  ^,ni^n,n^)  y'tYNY  2  Y  2 

(4.61  sl  m  -  — <ST(Vi-xn»  -  ECS?(VpV>  I 


Y  f  (UT ,nY) 
uu^  n’  ' 


r cY»Nrr"y 


buGI'ME'’  (5  (X  ,  ,X  -  E  (£  (X  .  ,X  ))fc], 
N'-  n-'1  n  '>n'-  n-1*  nJJ  n-1’  nJJ  J 


For  our  particular  problem,  due  to  the  truncation  effects  of  bN(‘)» 

<  A  <_  1/4  for  small  Y  and  the  signal  and  Wiener  process  components 

Y  y  V 

of  £j(*n-l»^n^  (for  1  >  n)  are  independent  of  {£  l ,  j  <  n,  U  j ,  j  <  n}. 

Thus,  the  sum  in  (4.6)  reduces  to  a  single  term.  The  general  (and 

more  complicated  than  needed)  summation  form  for  fj  is  introduced 

here  only  because  it  is  the  appropriate  form  of  fg(ny)  for  the 

generalizations  of  Section  5,  and  will  facilitate  the  discussion 

there.  For  the  same  reason,  f-^.f^  are  introduced  in  a  summation  form 

below,  even  though  for  the  problem  of  this  section,  the  sum  reduces 

to  a  single  term. 


When  expectations  of  the  form  E  5 J (X^_ ^ , Xn) ,  etc.,  are  written,  we 
mean  [E  5j(X,  X  )  J  ,  ;  i.e.,  the  X  ,  ,X  are  treated 

AA|i^i>A  Aj^  ii*‘i  n 

as  parameters  and  considered  to  be  fixed  when  computing  the  expectations 
Also  ^ JCXn_i ,Xn) ,  j  >  n,  is  defined  by  sJ(X,X')  with  X  =  X„_,,  X’  <=  X_ 
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Now,  applying  AY,J^  to  fg(nY)  yields 

Y  A^f0Y(nr)  -  yey-n  bNcuYn, 


(«n.dX„-Xn*l))2  *  E«I*l(Xn'X,.l»‘l 


(4.7) 


"  hHKHEn*KCXn-l>Xn»Z 


-  E£Y  (X  ,X  ))2] 
v  n  ^  n - 1  nJ J  1 


on  the  right 

The  part  of  the  last  term/containing  the  E^’  is  the  negative 
of  the  next  to  the  last  term  of  (4.3).  Also, 


(4.8) 


«t^<Xn-l-V’2  ’  YE<5n‘Xn*X„»2  +  °<Y>  • 


For  future  use  note  that  for  each  t^  >  0  and  i  =  0, 


(4.9) 


lim  sup  |fT(ny)|  =  0  w.p.l,  lim  sup  E |  fT(ny) |  =  0. 
Y-0  ny<tQ  Y-+-0  nY5tQ 


•  Y  Y 

Equation  (4.9)  will  also  hold  for  the  fjjf^  introduced  below. 

Y  Y 

Replacing  U  and  A  by  U  and  A  resp.,  in  the 
n+i  n+i  n  n 

V 

first  term  of  (4.7)  alters  that  term  only  by  a  quantity  02n 

y 

satisfying  (4.5).  In  fact,  02n  is  bounded  by  (4.10).  All  the 
introduced  subsequently  satisfy  (4.5),  but  explicit  bounds  will 
be  given. 


not 
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(4.10)  0(Y3/2)EnY’N[l+|^(*n-i>V  1  (1+l^+l(Xn»Xn)  I  + 

+  °(Y)EY>N|5Y+1(xn>xn+1)|  |w(n  +  2-A-an+1)-w(n  +  2-A  +  An)|  . 

With  ^n+l,Xn+l  replaced  by  UY,Xn  in  the  first  term  of  (4.7) 
that  term  has  the  value  zero,  due  to  the  independence  of  the  in¬ 
crements  of  the  Wiener  process  over  non-overlapping  time  intervals. 

Next,  we  turn  to  "averaging  out"  the  /y  term  of  (4.3).  This 
will  be  done  in  two  steps.  Define 


f/cnr)  •  /v  fu(UnY>nr)bN(unY)  .]  V^Vi-V 


j=n 


■  ^  fufUn-n^bNtUn>En'N5n(Xn-l-i„)- 


Applying  AY,IV(  to  f^nY)  yields 


(4.11)  YAY,NfY(nY)  =  -fY(nY)  + 


Y 

where  the  o3n  is  due  to  the  replacement  of  nY  +  Y  by  ny.  The 
function  fY  satisfies  (4.9).  The  first  term  of  (4.11)  is  the 
negative  of  the  /r  term  of  (4.3).  The  middle  term  of  (4.11)  will  have 
to  be  averaged  further. 
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The  middle  term  on  the  right  side  of  (4.11)  can  be  expanded 


as 


(4.12)  EY,N[f  (UY,nY)b..(UY)  +  (f  (UY,nY)bXI(UY))  (UY-UY)]5Y  (X  ,X  xl)  +  oY  , 
v  J  n  1  n’  J  n'  uv  n*  J  n"u  n+1  nJi  n+lv  n’  n+1'  4n’ 


The  component  of  (4.12)  involving  (f  bjj)  can  written  as 


(4:13)-  Y(fX-"^V»X<’^U<VVl>?X-l'Xn>  *'•« 

■  Y(fu<,nY)bN(uJ)^*^l(Xn,An)^  (!«.,, »„)  *  . 

The  component  of  (4.12)  involving  (f^^)  can  be  written  as 


*  fu<,nY)bN(uJ)E^N^n(Xn,Xn)  *  oJn(Y), 


the  first  term  of  which  equals  zero  by  virtue  of  the  independence  o 
the  increments  of  the  Wiener  process  over  non-overlapping  time 
intervals . 

v 

Next,  define  f£  by 

00  00 


(4.14) 


f>>  '  Y(fuCU>Y)bNCuX  l  I  [Ej’^x(x  >Xn)tJ(X  j.X  ) 

j  =n  k=j+l  J 


f 


By  the  comment  above  (4.14),  the  second  sum  is  zero  (in  the  more 
general  cases  of  Section  5,  it  will  not  necessarily  be  zero). 
Again,  by  the  independence  of  the  increments  of  w(>)  over  non¬ 
overlapping  time  intervals, 
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EI’N«k‘\.-VS!(Xn-i’V  -  Ki<VWVi-V-  (k 


y  i 


j)  . 


for  j  >  n  +  1  and  also  for  j  =  n  if  k>n+l  (in  which  case 
both  sides  of  the  above  equations  equal  zero).  Thus  (4.14)  equals 


(4. IS)  YC^CU^Y^CU^X'^lCVVSX-l-V  - 


It  can  be  shown  that 

YAY’Nf Y(nY)  =  -(4.15)  +  o^+  o(Y). 

One  component  of  (minus  (4.15)) is  the  negative  of  the  principal 
part  of  (4.13),  the  other  component  is  the  "averaged"  centering  term. 

V 

Also,  f2  satisfies  (4.9). 

Summarizing  the  above  calculations  and  recalling  that 
2 

fY(nY)  =  f(UY,nY)  +  1  fY(nY),  for  each  tn  <  °°,  N  <  °°, 

n  i  =  0  u 

(4.16)  lim  sup  E|fY(nY)’  f(uItnY)  |  =  0, 

Y-*0  nY<tQ 

lim  sup  |fT(nir)  -  f(UY,nY)|  =  0  w.p.l. 

Y-*-0  nY<tQ 

av  M  v  v  y 

Also,  taking  advantage  of  the  cancellations  in  A  ’  [f+fp+f^+f ^  \ ,  we 
have 

AY.NfY(„Y)=,  ft(uY,nY)  -  C8Uj)fu<,„Y)bN(u") 

*  7  <fuu<-"7JVu>«X>Xn>>* 


(4.17) 
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Changing  ^n>^n+i  t0  zero  in  the  right  side  of  (4.17)  alters  that 

r-  N 

term  by  0(/y).  Define  the  operator  A  by 


3  N  Y  3  VN  »n^  3^  Y  3  Y 

(ft  *  A  -  lit  *  -B-7 —  *  kX’"Y>lslf'u>1,l 

=  first  four  terms  on  right  of  (4.17),  but  with 
replaced  by  0. 


2  2 

■By  the-  properties  of  b^(-)»  we  have  v^(u,t)  =  v  ,  kN(u,t)  =  -Qu 
when  |u|  <  N.  Finally,  since  the  solution  of  (4.1)  is  unique, 
all  the  conditions  of  Theorem  1  hold  and  the  proof  is  completed. 
Q.E.D. 


22 


5.  Extensions 

5.1  General  noise  and  intersymbol  interference.  Only  an  informal 
discussion  will  be  given.  With  the  appropriate  scaling,  the  method 
is  similar  to  that  of  the  last  section.  Let  sT(t)  +  ^T(t)  denote 
the  input,  where  s,p(*)  =  input  signal,  *^(0  =  stationary  input 
noise  with  zero  mean  value. 

Suppose  that  the  channel  memory  is  given  by  a  function  h^O)- 
To  keep  the  system  from  degenerating  as  T  -*■  0 ,  we  use  hT(t)  = 
h(t/T)  for  some  transfer  function  h(*)>  For  notational  convenience 
assume  that  h(*)  f  0  only  over  a  finite  interval;  in  particular 
let  h(t)  =  0  for  t  >  Q  for  some  integer  Q.  Let  the  waveform 
transmitted  in  the  interval  [iT,iT+T)  have  the  form  s^q(t-iT), 
where  q(u)  =  0  out  of  [0,T]  and  {s^}  is  a  stationary 
sequence.  Then 

[t/T]  ft 

sT(t)  =  I  Si  h(Y-)q(T-iT)dx. 
i=[^]-Q  0 


Define 


ft  f t/T 

ST(t)  *  j^sT(u)du,  S(t)  =  ST(t)/T  =  sT(vT)dv. 


The  noise  model  is  based  on  two  considerations.  First,  for 
simplicity,  we  want  the  process  fj(')  to  have  only  a  finite  memory 
(convenient,  but  not  essential).  Second,  the  considerations 
discussed  below  (2.3)  still  hold  here;  i.e., 


we  want 
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1  2  2  2 

var  yT(s)ds  =  a.T  =  a  T  for  some  a.  To  accomplish  these  aims 

0  1  1  ft 

we  introduce  a  stationary  random  process  Y(*)>  define  T ( t )  =  ^(s)ds, 

J0 

assume  that  there  is  an  integer  R  such  that  for  each  tQ, 

(Y(s),  s  <  tQ}  is  independent  of  ('♦'(s),  s  >  tQ  +  R) ,  and  set 
H>TCO  =  Ht/T). 

The  finiteness  assumptions  connected  with  R,Q,  guarantee 

that  the  sums  defining  f/*  in  Theorem  2  contain  only  a  finite 
number  of  terms  (with  the  signal  and  noise  of  this  subsect  ion  used) . 

The  "tails"  of  these  sums  are  all  zero  by  the  finiteness  and  in¬ 
dependence  assumptions. 
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Next,  define  gn(A,A'),  g(x,x'),  )  as  above  (2.4), 

but  with  the  noise  and  signal  processes  of  this  subsection;  e.g., 
gn(x,x')  has  the  representation 

gn(x,x’)  =  | S (n+l+A+A ' )  -  S (n+A+A  )  +  T (n+l+A+A ' )  -  T(n+A+A)| 

-  |S(n+2-A+A')  -  S(n+1-A+A)  +  y  (n+2-A+A  ' )  -  4*  (n+1 -A+A )  |  . 

Again,  set  -0  =  g(A,A)|^_g  and  suppose  that  9  >  0,  for  otherwise 

the  system  (2.4)  will  be  unstable  for  small  Y  >  0. 

We  will  not  go  into  the  details,  but  the  method  of  Theorem  2 

works  here  also.  Given  f  €  (3t  the  general  (finite)  summation 
Y  N 

forms  of  the  fV  are  used  to  get  the  perturbed  test  function 
fY,N  (recall  that  superscripts  N  were  usually  omitted  in 
Theorem  2).  We  need  to  verify  that  (4.9)  holds,  and  that  (4.5) 

v  n 

holds  for  the  o^  error  terms.  There  can  be  verified  under 

reasonable  conditions  on  ^  (  •  )  •  Assuming  this.  Theorem  2  holds 

2 

but  with  the  first  term  of  v  replaced  by  (the  sum  contains  at 
most  max(Q,R)+l  terms). 

00 

2  l  Ed  +1(0,0)d(0,0). 

k=0  K  1  u 

5.2.  Random  clock  drift.  For  simplicity,  return  to  the  problem 
formulation  of  Sections  1  and  2,  but  suppose  that  the  transmitter 
clock  drifts.  In  particular,  let  the  signal  take  the  value 
s(t)  =  on  lTi>Ti+i)  rather  than  as  [iT,iT+T),  where 
t0  =  60*  Tn+i  *  xn  +  T  +  4n+1»  where  $n  is  a  zero  mean  random 


l 
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variable  such  that  $n/T  is  small.  Write  =  £  6^. 

tn  =  nT  +  6^.  The  system  is  given  by  Figure  1,  and  still 

denotes  the  estimate  of  the  epoch  <5*}.  Set  ^  =  [£  -6^1/T.  We 

use  the  algorithm  (2.3),  (2.4)  which  we  write  in  the  form 


(5.1) 


~n+l 


=  e  +  Ye  (e 
n  nv  n-1 


/T, 


VT> 


The  integrator  dumping  timing  is  still  given  by  Figure  2  but  with 
the  current  definition  of  {en}.  Figure  3  is  merely  a  translation 
of  Figure  2  into  the  "Tn"  notation.  In 

particular, note  that  (n+A)T  +  £  ,  =  AT  +  X  .T+i  -  6  ,  and 

n-i  n-i  n  n 

(n+l-A)T  .  5nl  .  (l-A)T  *  XnJT  *  Tn  - 

Define  g^X^j  ,*„)  =  as  in  Section  2. 

Referring  to  Figure  3,  note  that 


Sn(Vl'V  '  lw(A+An  *WT  -Sn.j/TJ-wtA.X^j.T^/T-S^T) 

*  *  WV4-Vl/T> 

(5-2)  -  |.(w*yVi/T-,,.1/T)  •  i'(1'“1,-i*V,-yT) 

*  s»K‘-Vl*(yVl)/m  * 

8n(^»^')  i-s  defined  as  in  (5.2)  with  parameters  X , X ’  replacing 
*n-l’*n’  resP-  Let  g(*»*')  =  bgn(A,A').  Next  rewrite  (5.1)  as 

(5'3)  Xn.V  Xn  -  Vl'T  *  Y«„CXn-l-\,)- 

Define  by  Snn/T/Y  =  ^  Define  ?Y(X,X'),UY  and  UYtO  as 
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above  (2.4),  but  using  the  of  (5.2).  Let  0  =  "  g(^>^)l\=Q 

and  assume  that  6  >  0.  As  in  Sections  2,  4,  we  work  with  the 
partially  linearized  system,  which  is 

<5-4’  Un*l  "  ul  -  ^  ^  *n' 

In  fact,  the  "linearization  errors"  go  to  zero  on  any  finite 
interval  (n:  nY  <  t^}  as  y  0. 

For  the  sake  of  simplicity,  let  there  be  an  Nq  not  depending 

v 

on  T  or  Y  such  that  {<!'.,  i  f  n}  is  independent  of 
Y 

i  >  n  +  Nq)  for  each  n.  This  is  used  only  to  assure  that 

y  u 

the  sums  f I  *  defined  in  Theorem  2  have  a  finite  number  of  non- 

Y 

zero  terms.  Also,  suppose  that  is  independent  of  w(-) 

(these  assumptions  are  not  necessary,  but  simplify  the  discussion). 

In  order  to  effectively  track  changes  in  the  timing,  the  drift  terms 
{6^}  must  be  "of  the  order  of  Y"  (loosely  speaking).  In  particular, 
we  assume  that  {'I'!}  is  stationary,  with  a  covariance  not  depending 
on  Y. 

The  method  of  Theorem  2  can  now  be  applied  and  the  limit 
process  is  ~  — 

(5.5)  dU  =  -0Udt  +  VjdB, 

y 

where  (v  is  defined  in  Theorem  2) 

QO 

(5.6)  v2  =  v2  +  [E(*J)2  +  2  .1  . 

Y 

The  added  term  in  (5.6)  is  due  to  the  { /y  clock  drift  process. 

Y 

If  were  not  independent  of  w(*)i  then  there  would  be  an 

additional  "cross"  term  in  (5.6). 
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